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When physically realized, quantum information processing (QIP) can be used to solve problems in 
physics simulation, cryptanalysis and secure communication for which there are no known efficient so- 
lutions based on classical information processing. Numerous proposals exist for building the devices 
required for QIP by using a variety of systems that exhibit quantum properties. Examples include nuclear 
spins in molecules, electron spins or charge in quantum dots, collective states of superconductors, and 
photons [1]. In all of these cases, there are well established physical models that, under ideal conditions, 
allow for exact realizations of quantum information and its manipulation. However, real physical systems 
never behave exactly like the ideal models. The main problems are environmental noise, which is due to 
incomplete isolation of the system from the rest of the world, and control errors, which are caused by cali- 
bration errors and random fluctuations in control parameters. Attempts to reduce the effects of these errors 
are confronted by the conflicting needs of being able to control and reliably measure the quantum systems. 
These needs require strong interactions with control devices, and systems sufficiently well isolated to 
maintain coherence, which is the subtle relationship between the phases in a quantum superposition. The 
fact that quantum effects rarely persist on macroscopic scales suggests that meeting these needs requires 
considerable outside intervention. 

Soon after P. Shor published the efficient quantum factoring algorithm with its applications to break- 
ing commonly used public-key cryptosystems, A. Steane [2] and P. Shor [3] gave the first constructions of 
quantum error-correcting codes. These codes make it possible to store quantum information so that one 
can reverse the effects of the most likely errors. By demonstrating that quantum information can exist in 
protected parts of the state space, they showed that, in principle, it is possible to protect against environ- 
mental noise when storing or transmitting information. Stimulated by these results and in order to solve 
the problem of errors happening during computation with quantum information, researchers initiated a se- 
ries of investigations to determine whether it was possible to quantum-compute in a fault-tolerant manner. 
The outcome of these investigations was positive and culminated in what are now known as "accuracy 
threshold theorems" [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]. According to these theorems, if the effects of 
all errors are sufficiently small per quantum bit (qubit) and step of the computation, then it is possible to 
process quantum information arbitrarily accurately with reasonable resource overheads. The requirement 
on errors is quantified by a maximum tolerable error rate called the threshold. The threshold value depends 
strongly on the details of the assumed error model. All threshold theorems require that errors at different 
times and locations be independent and that the basic computational operations can be applied in parallel. 
Although the proven thresholds are well out of range of today's devices there are signs that in practice, 
fault-tolerant quantum computation may be realizable. 

In retrospect, advances in quantum error correction and fault-tolerant computation were made possible 
by the realization that accurate computation does not require the state of the physical devices supporting 
the computation to be perfect. In classical information processing, this observation is so obvious that it is 
often forgotten: No two letters "e" on a written page are physically identical, and the number of electrons 
used to store a bit in a computer's memory varies substantially. Nevertheless, we have no difficulty in 
accurately identifying the desired letter or state. A crucial conceptual difficulty with quantum information 
is that by its very nature, it cannot be identified by being "looked" at. As a result, the sense in which 
quantum information can be accurately stored in a noisy system needs to be defined without reference 
to an observer. There are two ways to accomplish this task. The first is to define stored information to 
be the information that can, in principle, be extracted by a quantum decoding procedure. The second is 
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to explicitly define "subsystems" (particle-like aspects of the quantum device) that contain the desired 
information. The first approach is a natural generalization of the usual interpretations of classical error- 
correction methods, whereas the second is motivated by a way of characterizing quantum particles. 

In this introduction we motivate and explain the "decoding" and "subsystems" view of quantum error 
correction. We explain how quantum noise in QIP can be described and classified, and summarize the 
requirements that need to be satisfied for fault tolerance. Considering the capabilities of currently available 
quantum technology, the requirements appear daunting. But the idea of "subsystems" shows that these 
requirements can be met in many different, and often unexpected ways. 

Our introduction is structured as follows: The basic concepts are introduced by example, first for 
classical and then for quantum codes. We then show how the concepts are defined in general. Following 
a discussion of error models and analysis (Sect. 4), we state and explain the necessary and sufficient 
conditions for detectability of errors and correctability of error sets (Sect. 5). This is followed by a brief 
introduction to two of the most important methods for constructing error-correcting codes and subsystems 
(Sect. 6). For a basic overview, it suffices to read the beginnings of these more technical sections. The 
principles of fault-tolerant quantum computation are outlined in the last section. 



1 Concepts and Examples 



Communication is the prototypical application of error-correction methods. To communicate, a sender 
needs to convey information to a receiver over a noisy "communication channel". Such a channel can be 
thought of as a means of transmitting an information-carrying physical system from one place to another. 
During transmission, the physical system is subject to disturbances that can affect the information carried. 
To use a communication channel, the sender needs to "encode" the information to be transmitted in the 
physical system. After transmission, the receiver "decodes" the information. The procedure is shown in 
Fig. 1. 
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FIG. 1: A typical application of error-correction methods: The illustration shows the three main steps 
required for communication. Information is first encoded in a physical system, then transmitted over the 
noisy communication channel and finally decoded. The combination of encoding and decoding is chosen 
so that errors have no effect on the transmitted information. 

The protection of stored information is an other important application of error-correction methods. In 
this case, the user encodes the information in a storage system and retrieves it at a later time. Provided 
that there is no communication from the receiver to the sender, any error-correction method applicable 
to communication is also applicable to storage and vice versa. In Sect. 7 we discuss the problem of 
fault-tolerant computation, which requires enhancing error-correction methods in order to enable applying 
operations to encoded information without losing protection against errors. 

To illustrate the different features of error-correction methods we consider three examples. We begin 
by describing them for classical information, but in each case, there is a quantum analogue that will be 
introduced later. 

1.1 Trivial Two-Bit Example 

Consider a physical system consisting of two bits with state space {oo, oi, lo, ii}. We use the convention 
that state symbols for physical systems subject to errors are in brown. States changed by errors are shown 
in red' . In this example, the system is subject to errors that flip (apply the not operator to) the first bit 
with probability .5. We wish to safely store one bit of information. To this end, we store the information 
in the second physical bit, because this bit is unaffected by the errors (Fig. 2). 

'These graphical conventions are not crucial for understanding what the symbols mean and are for emphasis only. 
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Physical system and error model 



Usage examples. 

Store in the second bit: 




Store 1 in the second bit: 
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FIG. 2: A simple error model. Errors affect only the first bit of a physical two bit system. All joint 
states of the two bits are affected by errors. For example, the joint state oo is changed by the error to lo. 
Nevertheless the value of the information represented in the second physical bit is unchanged. 

As suggested by the usage example in Fig. 1, one can "encode" one bit of information in the physical 
system by the map that takes o ^ oo and i — > oi. This means that the states o and i of an ideal bit are 
represented by the states oo and oi of the noisy physical system, respectively. 
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To "decode" the information one can extract the second bit by the following map: 



00 
10 
01 

11 






1 
1 



(1) 



This procedure ensures that the encoded bit is recovered by the decoding regardless of the error. There are 
other combinations of encoding and decoding that work. For example, in the encoding, we could swap 
the meaning of o and i by using the map o ^ oi and i oo. The new decoding procedure adds a bit 
flip to the one shown above. The only difference between this combination of encoding/decoding and the 
previous one lies in the way in which the information is represented inside the range of the encoding. The 
range consists of the two states oo and oi and is called the "code". The states in the code are called "code 
words". 

Although trivial, the example just given is typical of ways for dealing with errors. That is, there is 
always a way of viewing the physical system as a pair of abstract systems: The first member of the pair 
experiences the errors and the second carries the information to be protected. The two abstract systems 
are called "subsystems" of the physical system and are usually not identifiable with any of the system's 
physical components. The first is the "syndrome" subsystem and the second is the "information-carrying" 
subsystem. Encoding consists of initializing the first system and storing the information in the second. 
Decoding is accomplished by extraction of the second system. In the example, the two subsystems are 
readily identified as the two physical bits that make up the physical system. The first is the syndrome 
subsystem and is initialized to o by the encoding. The second carries the encoded information. 



1.2 The Repetition Code 

The next example is a special case of the main problem of classical error-correction and occurs in typical 
communication settings and in computer memories. Let the physical system consist of three bits. The 
effect of the errors is to independently flip each bit with probability p, which we take to be p = .25. The 
repetition code results from triplicating the information to be protected. An encoding is given by the map 
000, 1 — i> 111. The repetition code is the set {ooo, iii}, which is the range of the encoding. The 
information can be decoded with majority logic: If the majority of the three bits is o, output o, otherwise 
output 1. 

How well does this encoding/decoding combination work for protecting one bit of information against 
the errors? The decoding fails to extract the bit of information correctly if two or three of the bits were 
flipped by the error. We can calculate the probability of incorrect decoding as follows: The probability of 
a given pair of bits having flipped is .25^ * .75. There are three different pairs. The probability of three bits 
having flipped is .25^. Thus the probability of error in the encoded bit is 3 ■ .25^ * .75 + .25'^ = 0.15625. 
This is an improvement over .25, which is the probability that the information represented in one of the 
three physical bits is corrupted by error. 

To see that one can interpret this example by viewing the physical system as a pair of subsystems, it 
suffices to identify the physical system's states with the states of a suitable pair. The following shows such 
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a "subsystem identification": 
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(2) 



The left side consists of the 8 states of the physical system, which are the possible states for the three 
physical bits making up the system. The right side shows the corresponding states for the subsystem pair. 
The syndrome subsystem is a two bit subsystem, whose states are shown first. The syndrome subsystem's 
states are called "syndromes". After the "■" symbol are the states of the information-carrying one-bit 
subsystem. 

In the subsystem identification above, the repetition code consists of the two states for which the 
syndrome is oo. That is, the code states ooo and iii correspond to the states oo ■ o and oo ■ i of the 
subsystem pair. For a state in this code, single-bit flips do not change the information-carrying bit, only 
the syndrome. For example, a bit flip of the second bit changes ooo to oio which is identified with oi ■ o. 
The syndrome has changed from oo to oi. Similarly, this error changes iii to loi ^ oi ■ i. The following 
diagram shows these effects: 



000 ^ 00-0 

i 

010 01-0 



111 ^ 00-1 

i 

101 01 ■ 1 



(3) 



Note that the syndrome change is the same. In general, with this subsystem identification, we can infer 
from the syndrome which single bit was flipped on an encoded state. 

Errors usually act cumulatively over time. For the repetition code this is a problem in the sense that 
it takes only a few actions of the above error model for the two- and three-bit errors to overwhelm the 
encoded information. One way to delay the loss of information is to decode and re-encode sufficiently 
frequently. Instead of explicitly decoding and re-encoding, the subsystem identification can be used di- 
rectly for the same effect, namely, that of resetting the syndrome subsystem's state to oo. For example, 
if the state is lo ■ i, it needs to be reset to oo ■ i. Therefore, using the subsystem identification, resetting 
the syndrome subsystem requires changing the state on to iii. It can be checked that, in every case, 
what is required is to set all bits of the physical system to the majority of the bits. After the the syndrome 
subsystem has been reset, the information is again protected against the next one-bit error. 



1.3 A Code for a Cyclic System 

We next consider a physical system that does not consist of bits. This system has seven states symbolized 
by 0, 1, 2, 3, 4, 5 and 6. Let Si be the right-circular shift operator defined by Si(/) = / + 1 for < / < 5 
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and si(6) = 0. Define sq = 1 (the identity operator), 



k times 




FIG. 3: A seven state cyclic system. The position of the pointer on the seven-position dial determines the 
state of the system. With the pointer in the position shown, the state is 1. Errors have the effect of rotating 
the pointer clockwise (to the "right") or counter-clockwise (to the "left"). The effect of si is to rotate the 
pointer clockwise as shown by the red arrow. 

Suppose that the errors consist of applying Sk with probability qe~^^ , where q = 0.5641 is chosen so that 
the probabilities sum to 1, that is XlfeL-oo ^ ^- Thus sq has probability 0.5641, and each of s_i and 
si has probability 0.2075. These are the main errors that we need to protect against. Continuous versions 
of this error model in the context of communication channels are known as "Gaussian channels". 
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One bit can be encoded in this physical system by the map o ^ 1, i — 4. To decode with protection 
against sq, s_i and si, use the mapping: 



1 ^ 
2^0 

3- 1 (5) 

4- ^1 

5 -> 1 

6 ^ fail 

If state 6 is encountered, we know that an error involving a shift of at least 2 (left or right) occurred, but 
there is no reasonable way of decoding it to the state of a bit. This means that the error is detected, but we 
cannot correct it. Error detection can be used by the receiver of information to ask for it to be sent again. 
The probability of correctly decoding with this code is at least 0.9792, which is the probability that the 
error caused a shift of at most one. 

As before, a pair of syndrome and information-carrying subsystems can be identified as being used 
by the encoding and decoding procedures. It suffices to correctly identify the syndrome states, which we 
name — i, o and i, because they indicate which of the likeliest shifts happened. The resulting subsystem 
identification is 

— -1-0 

1 ^ 0-0 

I " (6) 

3 — 1 ■ 1 

4 <-> 0-1 

5 ^ 1-1 

A new feature of this subsystem identification is that it is incomplete: Only a subset of the state space is 
identified. In this case, the complement can be used for error detection. 

Like the repetition code, this code can be used in a setting where the errors happen repeatedly. Again 
it suffices to reset the syndrome subsystem, in this case to o, to keep the encoded information protected. 
After the syndrome subsystem has been reset, a subsequent si or s_i error affects only the syndrome. 



2 Principles of Error Correction 

When considering the problem of limiting the effects of errors in information processing, the first task is 
to establish the properties of the physical systems that are available for representing and computing with 
information. Thus it is necessary to learn the following: 

1 . The physical system to be used, in particular the structure of its state space. 

2. The available means for controlling this system. 

3. The type of information to be processed. 

4. The nature of the errors, that is, the error model. 
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With this information, the approaches used to correct errors in the three examples provided in the previous 
section involve the following: 

1. Determine a code, which is a subspace of the physical system that can represent the information to 
be processed. 

2. a Identify a decoding procedure that can restore the information represented in the code after any one 
of the most likely errors occurred. 

2.b Or, determine a pair of syndrome and information-carrying subsystems such that the code corre- 
sponds to a "base" state of the syndrome subsystem and the primary errors act only on the syndrome. 

3. Analyze the error behavior of the code and subsystem. 

The tasks of determining a code and of identifying decoding procedures or subsystems are closely 
related. As a result, the following questions are at the foundation of the theory of error-correction: What 
properties must a code satisfy so that it can be used to protect well against a given error model? How 
does one obtain the decoding or subsystem identification that achieves this protection? In many cases, the 
answers can be based on choosing a fixed set of error operators that represents well the most likely errors 
and then determining whether these errors can be protected against without any loss of information. Once 
an error set is fixed, determining whether it is "correctable" can be cast in terms of the idea of "detectable" 
errors. This idea works equally well for both classical and quantum information. We introduce it using 
classical information concepts. 

2.1 Error Detection 

Error detection was used in the cyclic system example to reject a state that could not be properly decoded. 
In the communication setting, error control methods based on error detection alone work as follows: The 
encoded information is transmitted. The receiver checks whether the state is still in the code, that is, 
whether it could have been obtained by encoding. If not, the result is rejected. The sender can be informed 
of the failure so that the information can be retransmitted. Given a set of error operators that need to be 
protected against, the scheme is successful if for each error operator, either the information is unchanged, 
or the error is detected. Thus we can say that an operator E is "detectable" by a code if for each state x in 
the code, either Ex = x or Ex is not in the code. See Fig. 4 
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FIG. 4: Pictorial representation of typical detectable and undetectable errors for a code. Three examples 
are shown. In each, the code is represented by a brown oval containing three code words (green points). 
The effect of the error operator is shown as arrows. In (A), the error does not change the code words and 
is therefore considered detectable. In (B), the error maps the code words outside the code, so that it is 
detected. In (C), one code word is mapped to another, as shown by the red arrow. Finding that a received 
word is still in the code does not guarantee that it was the originally encoded word. The error is therefore 
not detectable. 

What errors are detectable by the codes in the examples? The code in the first example consists of 
00 and oi. Every operator that affects only the first bit is therefore detectable. In particular, all operators 
in the error model are detectable. In the second example, the code consists of the states ooo and iii. 
The identity operator has no effect and is therefore detectable. Any flips of exactly one or two bits are 
detectable because the states in the code are changed to states outside the code. The error that flips all bits 
is not detectable because it preserves the code but changes the states in the code. With the code for the 
cyclic system, shifts by —2, —1, 0, 1, 2 are detectable but not shifts by 3. 

To conclude the section, we state a characterization of detectability, which has a natural generalization 
to the case of quantum information. 

Theorem. E is detectable by a code if and only if for all x ^ y in the code. Ex ^ y. (7) 



2.2 From Error Detection to Error Correction 

Given a code C and a set of error operators S = {1 = Eq, Ei, E2, . . .} is it possible to determine whether 
a decoding procedure or subsystem exists such that £ is "correctable" (by C), that is, such that the errors 
in £ do not affect the encoded information? As explained below, the answer is yes and the solution is to 
check the condition in the following theorem: 

Theorem. £ is correctable by C if and only if for all x ^ y in the code and all i, j, it is 

true that EiX 7^ Ejy. (8) 

Observe that the notion of correctability depends on all the errors in the set under consideration and, unlike 
detectability, cannot be applied to individual errors. 
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To see that the condition for correctability in Thm. 8 is necessary, suppose that for some x ^ y in the 
code and some i and j, we have z = EiX = Ejy. If the state z is obtained after an unkown error in 6, then 
it is not possible to determine whether the original code word was x or y, because we cannot tell whether 
Ei or Ej occurred. 

To see that the condition for correctability in Thm. 8 is sufficient, we assume it and construct a decoding 
method z dec(z). Suppose that after an unknown error occurred, the state z is obtained. There can be 
one and only one x in the code for which some Ei(^z) £ £ satisfies the condition that Ei(^z)X = z. Thus x 
must be the original code word and we can decode z by defining x = dec(2;). Note that it is possible for 
two errors to have the same effect on some code words. A subsystem identification for this decoding is 
given by z ^ i{z) ■ dec(2;), where the syndrome subsystem's state space consists of error operator indices 
i{z), and the information-carrying system's consists of the code words dec(2;) returned by the decoding. 
The subsystem identification thus constructed is not necessarily onto the state space of the subsystem pair. 
That is, for different code words x, the set of i{z) such that dec(z) = x can vary and need not be all of 
the error indices. As we will show, the subsystem identification is onto the state space of the subsystem 
pair in the case of quantum information. It is instructive to check that, when applied to the examples, this 
subsystem construction does give a version of the subsystem identifications provided earlier. 

It is possible to relate the condition for correctability of an error set to detectability. For simplicity, 
assume that each E^ is invertible. (This assumption is satisfied by our examples, but not by error operators 
such as "reset bit one to o".) In this case, the correctability condition is equivalent to the statement that all 
products Ej^Ei are detectable. To see the equivalence, first suppose that some Ej^Ei is not detectable. 
Then there are x ^ y in the code such that Ej^EiX = y. Consequently EiX = Ejy and the error set is not 
correctable. This argument can be reversed to complete the proof of equivalence. 

If the assumption that the errors are invertible does not hold, the relationship between detectability 
and correctability becomes more complicated, requiring a generalization of the inverse operation. This 
generalization is simpler in the quantum setting. 

3 Quantum Error Correction 

The principles of error correction outlined in Sec. 2 apply to the quantum setting as readily as to the 
classical setting. The main difference is that the physical system to be used for representing and processing 
information behaves quantum mechanically and the type of information is quantum. The question of how 
classical information can be protected in quantum systems is also interesting but will not be discussed 
here. We illustrate the principles of quantum error correction by considering quantum versions of the 
three examples of Sect. 1 and then add a uniquely quantum example with potentially practical applications 
in, for example, quantum dot technologies. For an explanation of the basic quantum information concepts 
and conventions, see [16]. 

3.1 Trivial Two-Qubit Example 

A quantum version of the two bit example from the previous section consists of two physical qubits, where 
the errors randomly apply the identity or one of the Pauli operators to the first qubit. The Pauli operators 
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are defined by 



ay 




(9) 



Explicitly, the errors have the effect 
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where the superscripts in parentheses specify the qubit that an operator acts on. This error model is called 
"completely depolarizing" on qubit 1. Obviously, a one-qubit state can be stored in the second physical 
qubit without being affected by the errors. An encoding operation that implements this observation is 

\i^)^\o)M\^ (11) 

which realizes an ideal qubit as a two-dimensional subspace of the physical qubits. This subspace is the 
"quantum code" for this encoding. To decode one can discard physical qubit 1 and return qubit 2, which 
is considered a natural subsystem of the physical system. In this case, the identification of syndrome and 
information-carrying subsystems is the obvious one associated with the two physical qubits. 



3.2 Quantum Repetition Code 

The repetition code can be used to protect quantum information in the presence of a restricted error model. 
Let the physical system consist of three qubits. Errors act by independently applying, to each qubit, the flip 
operator ax with probability .25. The classical code can be made into a quantum code by the superposition 
principle. Encoding one qubit is accomplished by 

a|o)+/5|i) ^a|ooo)+/5|iii>. (12) 

The associated quantum code is the range of the encoding, that is, the two-dimensional subspace spanned 
by the encoded states |ooo) and |iii). 

As in the classical case, decoding is accomplished by majority logic. However, it must be implemented 
carefully to avoid destroying quantum coherence in the stored information. One way to do that is to use 
only unitary operations to transfer the stored information to the output qubit. Fig. 5 shows a quantum 
network that accompishes this task. 
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FIG. 5: Quantum network for majority logic decoding into the output qubit 3. The effect of the quantum 
network on the basis states is shown. The top half shows the states with majority o. The decoded qubit is 
separated in the last step. The conventions for illustrating quantum networks are explained in [16]. 

As shown, the decoding network establishes an identification between the three physical qubits and 
a pair of subsystems consisting of two qubits representing the syndrome subsystem and one qubit for 
the information-carrying subsystem. On the left side of the correspondence, the information-carrying 
subsystem is not identifiable with any one (or two) of the physical qubits. Nevertheless it exists there 
through the identification. 

To obtain a network for encoding, we reverse the decoding network and initialize qubits 2, 3 in the 
state |oo). Because of the initialization, the Toffoli gate becomes unnecessary. The complete system with 
a typical error is shown in Fig. 6. 
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Encode Decode 




FIG. 6: Encoding and decoding networks for the quantum repetition code with a typical error. The error 
that occurred can be determined from the state of the syndrome subsystem, which consists of the top 
two qubits. The encoding is shown as the reverse of the decoding, starting with an initialized syndrome 
subsystem. When the decoding is reversed to obtain the encoding, there is an initial Toffoli gate (shown in 
gray). Because of the initialization, this gate has no effect and is therefore omitted in an implementation. 

As in the case of the classical repetition code, we can protect against cumulative errors without explic- 
itly decoding and then re-encoding, which would cause a temporary loss of protection. Instead, one can 
find a means for directly resetting the syndrome subsystem to |oo) (thus returning the information to the 
code) before the errors happen again. After resetting in this way, the errors in the correctable set have no 
effect on the encoded information because they act only on the syndrome subsystem. 

Part of the task of designing error-correcting systems is to determine how well the system performs. 
An important performance measure is the probability of error. In quantum systems, the probability of 
error is intuitively interpreted as the maximum probability with which we can see a result different from 
the expected one in any measurement. Specifically, to determine the error, one compares the output \^o) 
of the system to the input l^p) . An upper bound is obtained if the output is written as a combination of the 
input state and an "error" state. For quantum information, combinations are linear combinations (that is, 
superpositions). Thus \^o) = l\'4') + k) (see Fig. 7). The probability of error is bounded by e = ||e)p 
(which we call an "error estimate"). In general, there are many different ways of writing the output as 
a combination of an acceptable state and an error term. One attempts to choose the combination that 
minimizes the error estimate. This choice yields the number e, for which 1 — e is called the "fidelity". A 
fidelity of 1 means that the output is the same (up to a phase factor) as the input. 
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FIG. 7: Representation of an error estimate. Any decomposition of the output state {tpo) into a "good" 
state ^\tp) and an (unnormalized) error term |e) gives an estimate e = | |e) p. For pure states, the optimum 
estimate is obtained when the error term is orthogonal to the input state. To obtain an error estimate for 
mixtures, one can use any representation of the state as a probabilistic combination of pure states and 
calculate the probabilistic sum of the pure state errors. 

To illustrate error analysis, we calculate the error for the repetition code example for the two initial 
states |o)and^(|o) + |i)). 
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(13) 



The final state is a mixture consisting of four correctly decoded components and four incorrectly decoded 
ones. The probability of each state in the mixture is shown before the colon. The incorrectly decoded 
information is orthogonal to the encoded information, and its probability is 0.1563, an improvement over 
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the one-qubit error-probability of 0.25. The second state behaves quite differently: 



_L(|o) + |i)) -L(|000> + |111» 



decode 



.252*. 75: ^(|iio) + |ooi)) 



.0469: ^|ii>-(|i> + |o», (14) 



Not all error events have been shown, but in each case it can be seen that the state is decoded correctly, so 
the error is 0. This shows that the error probability can depend significantly on the initial state. To remove 
this dependence and give a state independent error quantity, one can use the "worst-case", the "average" 
or the "entanglement" error. See Sect. 4.2. 

3.3 Quantum Code for a Cyclic System 

The shift operators introduced earlier act as permutations of the seven states of the cyclic system. They 
can therefore be extended to unitary operators on a seven-state cyclic quantum system with logical basis 
|0), |1), |2), |3), |4), |5), |6). The error model introduced earlier makes sense here without modification, 
as does the encoding. The subsystem identification now takes the six-dimensional subspace spanned by 
|0), . . . |5) to a pair consisting of a three-state system with basis | — 1), |0), |1) and a qubit. The identifica- 
tion of Eq. 6 extends linearly to a unitary subsystem identification. The procedure for decoding is modified 
as follows: First, a measurement is performed to determine whether the state is in the six-dimensional sub- 
space or not. If it is, the identification is used to extract the qubit. Here is an outline of what happens when 
the state (|o) + |i)) is encoded: 

_L(|o> + |,>, ^m + w) 

MMle-*-. ^(|3> + |6)), 



detect 



.001 : 



.5 : fail 
.5: |3> 
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decode 
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fail 



I l>-(|(|o> + |i» + |(-|o) + |i))) 



(15) 



A "good" state was separated from the output in the case that is shown. The leftover error term has 
probability amplitude .0005* ((1/2)^ + (1/2)^) = .00025, which contributes to the total error (not shown). 



3.4 Three Quantum Spin-1/ 2 Particles 

Quantum physics provides a rich source of systems with many opportunities for representing and protect- 
ing quantum information. Sometimes it is possible to encode information in such a way that it is protected 
from the errors indefinitely, without intervention. An example is the trivial two-qubit system discussed 
before. Whenever error protection without intervention is possible, there is an information-carrying sub- 
system such that errors act only on the associated syndrome subsystem regardless of the current state. An 
information-carrying subsystem with this property is called "noiseless". A physically motivated exam- 
ple of a one-qubit noiseless subsystem can be found in three spin-| particles with errors due to random 
fluctuations in an external field. 

A spin-| particle's state space is spanned by two states ||) and ||). Intuitively, these states correspond 
to the spin pointing "up" (|T)) or "down" (|i)) in some chosen reference frame. The state space is therefore 
the same as that of a qubit and we can make the identifications |t) ^ |o) and ||) ^ An external 
field causes the spin to "rotate" according to an evolution of the form 



-i{uxCrx+Uyay+UzCrz)t/2 



(16) 



The vector u = {ux, Uy, Uz) characterizes the direction of the field and the strength of the spin's interaction 
with the field. This situation arises, for example, in nuclear magnetic resonance with spin-| nuclei, where 
the fields are magnetic fields (see [17]). 

Now consider the physical system composed of three spin-| particles with errors acting as identical 
rotations of the three particles. Such errors occur if they are due to a uniform external field that fluctuates 
randomly in direction and strength. The evolution caused by a uniform field is given by 



-i{u^ (a, (1) (2) 0) )+Uy {ay W +ay (2) +^7^,(3) )+u, {a, (D (2) +<t, (3) ))t/2 



-i{UxJx+UyJy+UzJz)t 



(17) 



with Ju = (cr„(-^) + (Tu^^) + cr„^^^) /2 for u = x,y and z. We can exhibit the error operators arising from 
a uniform field in a compact form by defining J = (J^,, Jy, J^) and v = {ux,Uy,Uz)t. Then the error 
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operators are given by E{v) = e~™ "^, where the dot product in the exponent is calculated like the standard 
vector dot product. 

For a one-qubit noiseless subsystem, the key property of the error model is that the errors are symmetric 
under any permutation of the three particles. A permutation of the particles acts on the particles' state 
space by permuting the labels in the logical states. For example, the permutation vr that swaps the first two 
particles acts on logical states as 

^\a)M\^)s = HHH = \WM- (18) 

To say that the errors are symmetric under particle permutations means that each error E satisfies n^^En = 
E, or equivalently Eir = -nE (E "commutes" with tt). To see that this condition is satisfied, write 



71 ^E(v)'7l 



71 e 71 

^ — ilT~^{v-J)TT 



If TT permutes particle a to b, then 7Y^^a.a^^^n = a^S^^. It follows that tt^^Jtt = J. This expression 
shows that the errors commute with the particle permutations and therefore cannot distinguish between 
the particles. An error model satisfying this property is called a "collective" error model. 

If a noiseless subsystem exists, then it suffices to learn the symmetries of the error model to construct 
the subsystem. This procedure is explained in Sect. 6.2. For the three spin-| system, the procedure results 
in a one-qubit noiseless subsystem protected from all collective errors. We first exhibit the subsystem 
identification and then discuss its properties to explain why it is noiseless. As in the case of the seven- 
state cyclic system, the identification involves a proper subspace of the physical system's state space. The 
subsystem identification involves a four-dimensional subspace and is defined by the following correspon- 
dence: 



M\i\mX + e-^''^/'IT>Ji>JT>3 + e^^-/3|T>JT>Ji>3] - |T>-|o> 

;)5(li>JT>JT>3 + e^^'^/3|T>Ji>JT>3 + e-W3|T>JT>Ji>3j - |T>-|i> 

-;^(lT>Ji>Ji>3 + e-2'^/3|i>jT>Ji>3 + e^^-/3|i>J4|T>3) - li> |o> 

-^(|T>J4I4 + e^''^/'li>JT>Ji>3 + e-^'^/'IOJOJU) - li>-|i> 



(20) 



The state labels for the syndrome subsystem (before the dot in the expressions on the right side) identify 
it as a spin- 1 subsystem. In particular, it responds to the errors caused by uniform fields in the same 
way as the physical spin-i particles. This behavior is caused by 2 J„ acting as the n-Pauli operator on the 
syndrome subsystem. To confirm this property, we apply 2 J„ to the logical states of Eq. 20 for u = z, x. 
The property foru = y then follows because iay = azO-^. Consider 2J^. Each of the four states shown in 
Eq. 20 is an eigenstate of 2 J^. For example, the physical state for ||) ■ |o) is a superposition of states with 
two spins up (I) and one spin down (|). The eigenvalue of such a state with respect to 2.1^ is the difference 
A between the number of spins that are up and down. Thus, 2J^||) ■ |o) = |t) ■ |o). The difference is also 
A = 1 for It) ■ |i) and A = 1 for ||) ■ |o) and ||) ■ |i). Therefore, 2 acts as the 2;-Pauli operator on 



20 



the syndrome subsystem. To confirm this behavior for 2 J^., we compute 2 Ja;||) ■ |o). 



2J.|T> |0> = 2J.^(^|i>jr>jT>3+e-^2'^/3|r>Ji>jT>3 + e^^-/3|T>jr>J!>3 

+ e'^^/'^K^^) +^.^'^ +^.('^)iT>jnii>3 

TsMUIUlU + li>JUIT>3 + li>JT>Ji>3 
+e-''-^'j=Mi\\iX\1X + IT>JT>JT>3 + IT>Ji>Ji>3 
+ e^'^/'73(li>JT>Ji>3 + IT>JUI4 + IT>JT>JT>3 




e 



-j27r/3 _|_ pj27r/3 



+ e^^-/4|i)jT>Ji>3 
+ e-2-/=^ )|i>Ji>jT>3 

-75(lT>Ji>Ji>3 + e-''^/'IUIT>Ji>3 + e^'^/'li>Ji>Jt>3) 
= |i>|o>. (21) 

Similarly, one can check that, for the other logical states, the effect of 2 J^. is to flip the orientation of the 
syndrome spin. The fact that the subsystem identified in Eq. 20 is noiseless now follows from the fact that 
the errors E{v) are exponentials of sums of the syndrome spin operators J^. The errors therefore act as 
the identity on the information-carrying subsystem. 

The noiseless qubit supported by three spin-| particles with collective errors is another example in 
which the subsystem identification does not involve the whole state space of the system. In this case, the 
errors of the error model cannot remove amplitude from the subspace. As a result, if we detect an error, 
that is, if we find that the system's state is in the orthogonal complement of the subspace of the subsystem 
identification, we can deduce that either the error model is inadequate, or we introduced errors in the 
manipulations required for transferring information to the noiseless qubit. 

The noiseless subsystem of three spin-| particles can be physically motivated by an analysis of quan- 
tum spin numbers. The physical motivation is outlined in Fig. 8. 
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Noiseless qubit 
Spin 1/2 



Spin 3/2 



FIG. 8: One noiseless qubit from three spin-| particles. The left side shows the three particles, with errors 
caused by fluctuations in a uniform magnetic field depicted by a noisy coil. The spin along direction u 
{u = X, y, z) can be measured and its expectation is given by (^/'l J^ulV^), where is the quantum state 
of the particles and J„ is the total spin observable along the n-axis given by the half-sum of the n-Pauli 
matrices of the particles as defined in the text. The squared magnitude of the total spin is given by the 
expectation of the observable = J ■ J = + Jy + J^. The observable commutes with the 

Ju and therefore also with the errors E{v) = e"*^' caused by uniform field fluctuations. This can be 
verified directly, or one can note that E(v) acts on J as a rotation in three dimensions, and as one would 
expect, such rotations preserve the "squared length" of J. It now follows that the eigenspaces of 
are invariant under the errors, and therefore that the eigenspaces are good places to look for noiseless 
subsystems. The eigenvalues of are of the form j{j + 1), where j is the spin quantum number of the 
corresponding eigenspace. There are two eigenspaces, one with spin j = | and the other with spin j = §• 
The figure shows a thought experiment that involves passing the three-particle system through a type of 
beam splitter (BS) or Stem-Gerlach apparatus sensitive to J^. Using such a beam splitter, the system of 
particles can be made to go in one of two directions depending on j. In the figure, if the system's state is 
in the the spin-| subspace, it passes through the beam splitter; if it is in the spin-i subspace, the system 
is reflected up. It can be shown that the subspace with j = | is four-dimensional and spanned by the 
states that are symmetric under particle permutations. Unfortunately, there is no noiseless subsystem in 
this subspace (see Sect. 6.2). The spin-i subspace is also four dimensional and spanned by the states 
in Eq. 20. The spin-i property of the subspace implies that the spin operators Ju act in a way that is 
algebraically identical to the way o'u/2 acts on a single spin-^ particle. This property implies the existence 
of the syndrome subsystem introduced in the text. Conventionally, the spin-^ subspace is thought of as 
consisting of two orthogonal two-dimensional subspaces each behaving like a spin-| with respect to the 
Ju- This choice of subspaces is not unique, but by associating them with two logical states of a noiseless 
qubit, one can obtain the subsystem identification of Eq.20. Some care needs to be taken to ensure that the 
noiseless qubit operators commute with the J„, as they should (see Sect. 6.2). In the thought experiment, 
one can imagine unitarily rotating the system emerging in the upper path to make explicit the syndrome 
spin- 1 subsystem and the noiseless qubit with which it must be paired. The result of this rotation is shown. 
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4 Error Models 



We have seen several models of physical systems and errors in the examples of the previous sections. Most 
physical systems under consideration for QIP consist of particles or degrees of freedom that are spatially 
localized, a feature reflected in the error models that are usually investigated. Because we also expect 
the physically realized qubits to be localized, the standard error models deal with quantum errors that act 
independently on different qubits. Logically realized qubits, such as those implemented by subsystems 
different from the physically obvious ones, may have more complicated residual error behaviors. 

4.1 The Standard Error Models for Qubits 

The most investigated error model for qubits consists of "independent, depolarizing errors". This model 
has the effect of completely depolarizing each qubit independently with probability p (see Eq. 10). For 
one qubit, the model is the least biased in the sense that it is symmetric under rotations. As a result, every 
state of the qubit is equally affected. Independent depolarizing errors are considered to be the quantum 
analogue of the classical independent bit flip error model. 

Depolarizing errors are not typical for physically realized qubits. However, given the ability to con- 
trol individual qubits, it is possible to enforce the depolarizing model (see below). Consequently, error- 
correction methods designed to control depolarizing errors apply to all independent error models. Never- 
theless, it is worth keeping in mind that given detailed knowledge of the physical errors, a special purpose 
method is usually better than one designed for depolarizing errors. We therefore begin by showing how 
one can think about arbitrary error models. 

There are several different ways of describing errors affecting a physical system S of interest. For 
most situations, in particular if the initial state of S is pure, errors can be thought of as being the result 
of coupling to an initially independent environment for some time. Because of this coupling, the effect 
of error can always be represented by the process of adjoining an environment E in some initial state |0)^ 
to the arbitrary state |^)^ of S, followed by a unitary coupling evolution t/^^^^ acting jointly on E and S. 
Symbolically, the process can be written as the map 



Choosing an arbitrary orthonormal basis consisting of the states |e)^ for the state space of the environment, 
the process can be rewritten in the form: 



(22) 



e 




(23) 
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where the last step defines operators Ae*-^^ acting on S by Ag*^^-* = ^(e|t/(^^^|0)^. The expression |e)^/le''' 
is called an "environment labeled operator". The unitarity condition implies that ^e^e = ^ (with sys- 
tem labels omitted). The environment basis |e)^ need not represent any physically meaningful choice of 
basis of a real environment. For the purpose of error analysis, the states |e)^ are formal states that "label" 
the error operators Ae. One can use an expression of the form shown in Eq. 23 even when the |e) are not 
normalized or orthogonal, keeping in mind that as a result, the identity implied by the unitarity condition 
changes. 

Note that the state on the right side of Eq. 23 representing the effect of the errors is correlated with 
the environment. This means that after removing (or "tracing over") the environment, the state of S is 
usually mixed. Instead of introducing an artificial environment, we can also describe the errors by using 
the density operator formalism for mixed states. Define p = |V^^(^| ■ The effect of the errors on the density 
matrix p is given by the transformation 

p^J]A,pAt. (24) 

e 

This is the "operator sum" formalism [18]. 

The two ways of writing the effects of errors can be applied to the depolarizing-error model for one 
qubit. As an environment-labeled operator, depolarization with probability p can be written as 

v^r^|o>,]i + ^(|i>,ii + 14^. + bX'jy + 14^.), (25) 

where we introduced five abstract, orthonormal environment states to label the different events. In this 
case, one can think of the model as applying no error with probability 1 — p, or completely depolariz- 
ing the qubit with probability p. The latter event is represented by applying one of 1, a^, Cy or with 
equal probability p/A. To be able to think of the model as randomly applied Pauli matrices, it is crucial 
that the environment states labeling the different Pauli matrices be orthogonal. The square roots of the 
probabilities appear in the operator because in an environment-labeled operator, it is necessary to give 
quantum amplitudes. Environment labeled operators are useful primarily because of their great flexibility 
and redundancy. 

In the operator sum formalism, depolarization with probability p transforms the input density matrix p 

as 

p 

p {I - p)p + ^{tpt + a^pcr^ + (yypay + a^pa^) 

p 

= {1 - 3p/4:)p + -{a^pa^ + (Typay + cr^pa^) . (26) 

Because the operator sum formalism has less redundancy, it is easier to tell when two error effects are 
equivalent. 

In the remainder of this section, we discuss how one can use active intervention to simplify the error 
model. To realize this simplification, we intentionally randomize the qubit so that the environment cannot 
distinguish between the different "axes" defined by the Pauli spin matrices. Here is a simple randomization 
that actively converts an arbitrary error model for a qubit into one that consists of randomly applying Pauli 
operators according to some distribution. The distribution is not necessarily uniform so the new error 
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model is not yet depolarizing. Before the errors act, apply a random Pauli operator au (u = 0, x, y, z, 
ctq = After the errors act apply the inverse of that operator, = a^, then "forget" which operator was 
applied. This randomization method is called "twirling" [19]. To understand twirling, we use environment 
labeled operators to demonstrate some of the techniques useful in this context. The sequence of actions 
implementing twirling can be written as follows (omitting labels for S): 

li') ~^ \ Yl,u \^X^u\i') ^PPly ^ random cr„, remembering u with the help of the system C. 

^ Y.e\^X\T.u\'^X^e(Tu\i^) crrors act. 

^ Ee \A\T.u \u\(yuAe(Tu\^) apply CT„ = 

— > Eeu 1*^^) ^'^uAeCTul^) forgct wMch M was uscd by absorbing its mcmory in E. 

(27) 

The system C that was artificially introduced to carry the memory of u may be a classical memory because 
there is no need for coherence between different \u)^. 

To determine the equivalent random Pauli operator error model, it is necessary to rewrite the total 
effect of the procedure using an environment labeled sum involving orthogonal environment states and 
Pauli operators. To do so, express as a sum of the Pauli operators, = aevO'v, using the fact that 
the are a linear basis for the space of one-qubit operators. Recall the fact that anticommutes with 
c^^, if 7^ n 7^ i; 7^ 0. Thus auO-^,au = (— l)^^'"^o"i,, where {v, u) = 1 if ^ u ^ v ^ and {v, u) = 
otherwise. We can now rewrite the last expression of Eq. 27 as follows: 

eu eu V 

It can be checked that the states \ Eu(~l)^^'"^ k^)^^ orthonormal for different e and v. As a result the 
states |aeii( — 1)^"'"^ \^^Xc orthogonal for different v and have probability (square norm) given by 
Pv = Ee I'^ejjP- Introducing ^/^\,\vX^ = Yl,eu |Q^eD(— we can write the sumof Eq. 28 as 

E (E^"-(-1)^''"^IHc ) = E v^l^>Bc^-l^>' (29) 

V \ eu / V 

showing that the twirled error model behaves like randomly applied Pauli matrices with applied with 
probability p^. It is a recommended exercise to reproduce the above argument using the operator sum 
formalism. 

To obtain the standard depolarizing error model with equal probabilities for the Pauli matrices, it 
is necessary to strengthen the randomization procedure by applying a random member U of the group 
generated by the 90° rotations around the x, y and z axes before the error and then undoing U by applying 

Randomization can be used to transform any one-qubit error model into the depolarizing error model. 
This explains why the depolarizing model is so useful for analyzing error correction techniques in sit- 
uations in which errors act independently on different qubits. However, in many physical situations, the 
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independence assumptions are not satisfied. For example, errors from common internal couplings between 
qubits are generally pairwise correlated to first order. In addition, the operations required to manipulate 
the qubits and to control the encoded information act on pairs at a time, which tends to spread even single 
qubit errors. Still, in all these cases, the primary error processes are local. This means that there usually 
exists an environment labeled sum expression for the total error process in which the amplitudes associ- 
ated with errors acting simultaneously at k locations in time and space decrease exponentially with k. In 
such cases, error-correction methods that handle all or most errors involving sufficiently few qubits are 
still applicable. 



4.2 Quantum Error Analysis 

One of the most important consequences of the subsystems interpretation of encoding quantum informa- 
tion in a physical system is that the encoded quantum information can be error-free even though errors 
have severely changed the state of the physical system. Almost trivially, any error operator acting only 
on the syndrome subsystem has no effect on the quantum information. The goal of error correction is 
to actively intervene and maintain the syndrome subsystem in states where the dominant error operators 
continue to have little effect on the information of interest. An important issue in analyzing error correc- 
tion methods is to estimate the residual error in the encoded information. A simple example of how that 
can be done was discussed for the quantum repetition code. The same ideas can be applied in general. 
Let S be the physical system in which the information is encoded and \ tp)^ an initial state containing such 
information with the syndrome subsystem appropriately prepared. Errors and error-correcting operations 
modify the state. The new state can be expressed using environment labeling as |e)^ Ae*-^^ In view 
of the partitioning into information-carrying and syndrome subsystems, "good" states |e)^ are those states 

for which Ae^^^ acts only on the syndrome subsystem given that the syndrome has been prepared. The 
remaining states |e) form the set of "bad" states, B. The error probability p^. can be bounded from above 
by 



Pe < 



\e£B / 



(30) 



where \A\i = max^ the maximum being taken over normalized states. The second inequality 

usually leads to a gross overestimate but is independent of the encoded information and often suffices for 
obtaining good results. Because the environment-labeled sum is not unique, a goal of the representation 
of the errors acting on the system is to use "good" operators to the largest extent possible. The flexibility 
of these error-expansions makes them very useful for analyzing error models in conjunction with error- 
correction methods. 

In principle, we can obtain better expressions for pe by calculating the density matrix p of the state of 
the subsystem containing the desired quantum information. This calculation involves "tracing over" the 
syndrome subsystem. The matrix p can then be compared to the intended state. If the intended state is pure. 



27 



given by the probability of error is given by 1 — (0|p|0), which is the probability that a measurement 
that distinguishes between |0) and its orthogonal complement fails to detect The quantity is 
called the "fidelity" of the state p. 

For applications to communication, the goal is to be able to reliably transmit arbitrary states through 
a communication channel, which may be physical or realized via an encoding/decoding scheme. It is 
therefore important to characterize the reliability of the channel independent of the information transmit- 
ted. Eq. 30 can be used to obtain state-independent bounds on the error probability but does not readily 
provide a single measure of reliability. One way to quantify the reliability is to identify the error of the 
channel with the average error over all possible input states. The reliability is then given by the average 
fidelity 1 — Another elegant way appropriate for QIP is to use the "entanglement fidelity" [20]. Entan- 
glement fidelity measures the error when the input is maximally entangled with an identical "reference" 
system. In this process, the reference system is imagined to be untouched, so that the state of the reference 
system together with the output state can be compared to the original entangled state. For a one-qubit 
channel labeled S, the reference system is a qubit, which we label with R. An initial, maximally entangled 
state is 

|5> = ^(KK + |i>Ji>3). (31) 

The reference qubit is assumed to be perfectly isolated and not affected by any errors. The final state p*^'^^) 
is compared to \B), which gives the entanglement fidelity according to the formula /g = {B\p'-^^^B). 
The entanglement error is eg = 1 — /e. It turns out that this definition does not depend on the choice of 
maximally entangled state. Fortunately, the entanglement error and the average error e„ are related by a 
linear expression: 

ea = gfe- (32) 

For fc-qubit channels, the constant | is replaced by 2^ /{2'^ + 1). Experimental measurements of these 
fidelities do not require the reference system. There are simple averaging formulas to express them in 
terms of the fidelities for transmitting each of a sufficiently large set of pure states. An example of the 
experimental determination of the entanglement fidelity when the channel is realized by error-correction 
is provided in [21]. 



5 From Quantum Error Detection to Error Correction 

In the independent depolarizing error model with small probability p of depolarization, the most likely 
errors are those that affect a small number of qubits. That is, if we define the "weight" of a product of 
Pauli operators to be the number of qubits affected, the dominant errors are those of small weight. Because 
the probability of a non-identity Pauli operator is 3j9/4 (see Eq. 25), one expects about of n qubits 
to be changed. As a result, good error-correcting codes are considered to be those for which all errors 
of weight < e ^ can be corrected. It is desirable that e have a high "rate", which means that it is a 
large fraction of the total number of qubits, n (the "length" of the code). Combinatorially, good codes are 
characterized by a high minimum distance, a concept that arises naturally in the context of error-detection. 
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5.1 Quantum Error Detection 

Let C be a quantum code, that is, a subspace of the state space of a quantum system. Let P be the operator 
that projects onto C, and P-*- = 1 — P the one that projects onto the orthogonal complement. Then the 
pair P, P-*- is associated with a measurement that can be used to determine whether a state is in the code 
or not. If the given state is \^), then the result of the measurement is P\ tp) with probability |P|?/;) p and 
P-*"!^) otherwise. As in the classical case, an error-detection scheme consists of preparing the desired 
state E C, transmitting it through, say, a quantum channel, then measuring whether the state is still 
in the code, accepting the state if it is, and rejecting it otherwise. We say that C detects error operator 
E if states accepted after E had acted are unchanged except for an overall scale. Using the projection 
operators, this is the statement that for every state \^i) E C, PE\%l)i) = XeI'iPi)- Because P\^) is in the 
code for every {ip), it follows that PEP\ip) = A^P]^). Therefore, a characterization of detectability is 
gven by: 

Theorem. E is detectable by C if and only if PEP = XeP for some A^. (33) 

It is not difficult to see that a second characterization is given by: 

Theorem. E is detectable by C if and only if for all |0) E C, {^\E\(I)) = \e{^\(P) 

for some A^;. (34) 

A third characterization, which we state without proof, is obtained by taking the condition for classical 
detectability in Thm. 7 and replacing "7^" by "orthogonal to": 

Theorem. E is detectable by C if and only if for all |0), \^) in the code with |0) orthog- 
onal to {ijj), E\(j)) is orthogonal to \^). (35) 

For a given code C, the set of detectable errors is closed under linear combinations. That is, if Ei and 
E2 are both detectable, then so is aEi + aE2- This useful property implies that to check detectability, one 
has to consider only the elements of a linear basis for the space of errors of interest. 

Consider n qubits with independent depolarizing errors. A robust error-detecting code should detect 
as many of the small weight errors as possible. This requirement motivates the definition of "minimum 
distance": The code C has minimum distance d if the smallest-weight product of Pauli operators E for 
which C does not detect E is d. The notion comes from classical codes for bits, where a set of code words 
C has minimum distance d if the smallest number of flips required to change one code word in C into 
another one in C is d. For example, the repetition code for three bits has minimum distance 3. Note 
that the minimum distance for the quantum repetition code is one: Applying c^'^-^^ preserves the code and 
changes the sign of |iii) but not of |ooo). As a result, cr^^^^ is not detectable. The notion of minimum 
distance can be generalized for error models with specified "first order" error operators [22]. In the case 
of depolarizing errors, the first order error operators are single qubit Pauli matrices, which are the errors 
of weight one. 

5.2 Quantum Error Correction 

Let 8 = {Eq = 1, Pi, . . .} be the set of errors that we wish to be able to correct. When is there a decoding 
procedure for the code C such that all errors in S are corrected? When such a decoding procedure exists. 
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we say that S is "correctable" (by C). A situation in which correctability of £ is apparent occurs when the 
errors Ei are unitary operators satisfying the condition that EiC are mutually orthogonal subspaces. The 
repetition code has this property for the set of errors consisting of the identity and Pauli operators acting 
on a single qubit. In this situation, the procedure for decoding is to first make a projective measurement to 
determine which of the subspaces EiC the state is in, and then to apply the inverse of the error operator, 
E\. This situation is not far from the generic one. One characterization of correctability is in the following 
theorem: 

Theorem. 8 is correctable if and only if there is a linear transformation of the set 8 
such that the operators E'.^ in the new set satisfy the following properties: (1) The E[C are 
mutually orthogonal, and (2) E[ restricted to C is proportional to a restriction to C of a 
unitary operator. (36) 

To relate this characterization to detectability, note that the two properties imply that (E'^'^ E'^C is orthog- 
onal to C if ? 7^ j, and {E'^)'^E[ restricted to C is proportional to the identity on C. In other words, the 
(E'^y E'j are detectable. This detectability condition applied to the original error set constitutes a second 
characterization of correctability, given in the next theorem: 

Theorem. 8 is correctable if and only if the operators in the set = {E\E2 : E^ e 8} 

are detectable. (37) 

Before explaining the characterizations of correctability, we consider the situation of n qubits, where 
the characterization by detectability (37) leads to a useful relationship between minimum distance and 
correctability of low weight errors: 

Theorem. If a code on n qubits has a minimum distance of at least 2e + 1, then the set of 

errors of weight at most e is correctable. (38) 

This theorem follows by observing that the weight of E\E2 is at most the sum of the weights of the E^. 
As a result of this observation, the problem of finding good ways of correcting all errors up to a maximum 
weight reduces to that of constructing codes with sufficiently high minimum distance. Thus questions 
such as "what is the maximum dimension of a code of minimum distance d on n qubits?" are of great 
interest. As in the case of classical coding theory this problem appears to be very difficult in general. 
Answers are known for small n [6] and there are asymptotic bounds [23]. Of course, for achieving low 
error probabilities, it is not necessary to correct all errors of weight < e, just almost all such errors. 
For example, the concatenated codes used for fault-tolerant quantum computation achieve this goal (see 
Sec. 7). 

For the remainder of this section we explain the characterizations of correctability. Using the condi- 
tions for detectability from the previous section, the condition for correctability in Thm. 37 is equivalent 
to 



This condition is preserved under a linear change of basis for 8. That is, if A is any invertible matrix with 
coefficients Uij, we can define new error operators = J2i -^i^ifc- For the Df., the left side of Eq. 39 is 
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is, for all x, x^Kx > and = A), we can choose A such that A^^AA is of the form ( ^ ^ ) . In this 



— CLikttjlXijP 
ij 

= {A'^A)^^P, (40) 
where A is the matrix formed from the Ay . Using the fact that A is a positive semidefinite matrix (that 

T rVinncf A ciirVi fVinf 4tA A ic nf tVif form I 

X 

matrix, the upper left block is the identity operator for some dimension. 

An important consequence of invariance under a change of basis of error operators is that the set of 
errors correctable by a particular code and decoding procedure is linearly closed. Thus, if E and D are 
corrected by the decoding procedure, then so is aE + (3D. This observation also follows from the linearity 
of quantum mechanically implementable operations. 

We explain the condition for correctability by using the subsystems interpretation of decoding proce- 
dures. For simplicity, assume that 1 E S. To show that correctability of E implies detectability of all 
E E £^£, suppose that we have a decoding procedure that recovers the information encoded in C after 
any of the errors in £ have occurred. Every physically realizable decoding procedure can be implemented 
by first adding "ancilla" quantum systems in a prepared pure state to form a total system labeled T, then 
applying a unitary map U to the state of T, and finally separating T into a pair of systems S, Q, where 
S corresponds to the syndrome subsystem, and Q is a quantum system with the same dimension as the 
code that carries the quantum information after decoding. Denote the state space of the physical system 
containing C as H, and the state space of system X by Hx, where X is any one of the other systems. Let 
V be the unitary operator that encodes information by mapping Hq onto C C H. We have the following 
relationships: 

Hq ^ C CHCHt ^Hs^Hq. (41) 

Here, we used bidirectional arrows to emphasize that the operators V and U can be inverted on 
their range and therefore identify the states in their domains with the states in their ranges. The inclusion 
H C Hj implicitly identifies H with the subspace determined by the prepared pure state on the ancillas. 
The last state space of Eq. 41 is expressed as a tensor product C'®"), which is the state space of the 
combined system SQ. For states of Hq we write = ^ {tp)^ E C. Because 11 is a correctable 

error, it must be the case that |^^)^ ^ lO)^!^^) E Hs ^ Hq for some state |0)^ of the syndrome subsystem. 
To establish this fact, use linearity of the maps. In general: 

^ \^\\^) (42) 

The \i) need not be normalized or orthogonal. Let F be the subspace spanned by the \i) . Then U induces 
an identification of F ® Hq with a subspace C CH. This is the desired subsystem identification. We can 
then see how the errors act in this identification: 

i (43) 
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This means that for all |^) and 



\ij\E]E,\(t)X=%3\T)^{ij\cP), (44) 

that is, all errors in ^^^f^ are detectable. 

Now, suppose that all errors in S'^S are detectable. To see that this implies correctability of 8, choose 
a basis for the errors so that Ajj = 5ij\i with Aj = 1 for z < s and A^ = otherwise. Define a subsystem 
identification by 

W)^E,\4^\, (45) 

for < z < s. By assumption and construction, = 5ij, which implies that W is unitary 

(after linear extension), and so this is a proper identification. For i > s, Ei\il))^ = 0, which implies that for 
states in the code, these errors have probability 0. Therefore, the identification can be used to successfully 
correct £. 



6 Constructing Codes 
6.1 Stabilizer Codes 

Most useful quantum codes are based on "stabilizer" constructions [4, 5]. Stabilizer codes are useful 
because they make it easy to determine which Pauli-product errors are detectable and because they can 
be interpreted as special types of classical "linear" codes. The latter feature makes it possible to use 
well-established techniques from the theory of classical error-correcting codes to construct good quantum 
codes. 

A stabilizer code of length n for k qubits (abbreviated as an "[[n, k]] code"), is a 2^^ -dimensional 
subspace of the state space of n qubits that is characterized by the set of products of Pauli operators that 
leave each state in the code invariant. Such Pauli operators are said to "stabilize" the code. A simple 
example of a stabilizer code is the quantum repetition code introduced in Sec. 3.2. The code's states 
a|ooo) + are exactly the states that are unchanged after applying cr^^'^'^az^'^'' or a^^^^V^^^^ 

To simplify the notation, we write I = 1, X = ax,Y = ay, and Z = a^. A product of Pauli operators 
can then be written as ZIXI = az^'^'^cr-x^^^ (as an example of length 4) with the ordering determining which 
qubit is being acted upon by the operators in the product. 

We can understand the properties of stabilizer codes by working out the example of the quantum 
repetition code with the stabilizer formalism. A stabilizer of the code is 5" = {ZZI, ZIZ}. Let S be the 
set of Pauli products that are expressible up to a phase as products of elements of S. For the repetition 
code, S = {III, ZZI, ZIZ, IZZ}. S consists of all Pauli products that stabilize the code. The crucial 
property of S is that its operators commute, that is, for A, B E S, AB = BA. According to results 
from linear algebra, it follows that the state space Ti. can be decomposed into orthogonal subspaces Hx 
such that for A e 5 and {tp) E Hx, A\ilj) = \{A)\tp). The Hx are the common eigenspaces of S. The 
stabilizer code C defined by S is the subspace stabilized by the operators in S, which means that it is given 
by Hx with X{A) = 1. The subspaces for other \{A) have equivalent properties and are often included 
in the set of stabilizer codes. For the repetition code, the stabilized subspace is spanned by the logical 
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basis \ooo) and From the point of view of stabilizers, there are two ways in which a Pauli product 

B can be detectable: (1) If i? G S, because in this case B acts as the identity on the code, and (2) if 
B anticommutes with at least one member (say A) of S. To see the second way, let \^) be in the code. 
Then A iB\ip)) = (AB) = - (BA) \^) = -B = -B\^). Thus B\ij) belongs to Hx with 

\{A) = — 1. Because this subspace is orthogonal to C = Hi, B is, detectable. We define the set of Pauli 
products that commute with all members of S as S^. Thus, B is detectable if either B ^ or B E S. 
Note that because S consists of commuting operators, S C S-^. 

To construct a stabilizer code that can correct all errors of weight at most one (a "quantum one-error- 
correcting code"), it suffices to find S with the minimum weight of non-identity members of being 
at least three (3 = 2-1 + 1, see Thm. 38). In this case we say that has minimum distance three. 
As an example, we can exhibit a stabilizer for the famous length-five one-error-correcting code for one 
qubit[19, 24]: 

S = {XZZXI, IXZZX, XIXZZ, ZXIXZ}. (46) 

As a general rule, it is desirable to exhibit the stabilizer minimally, which means that no member is the 
product up to a phase of some of the other members. In this case, the number of qubits encoded is n — |S'|, 
where n is the length of the code and l^l is the number of elements of S. 

The correspondence between stabilizer codes and classical binary codes is obtained by replacing the 
symbols /, X, Y and Z in a Pauli product by 00, 01, 10 and 11, respectively. Thus, the members of the 
stabilizer can be thought of as binary vectors of length 2n. We use arithmetic modulo two for sums, inner 
products and application of a binary matrix. Because the numbers modulo two (Z2) form a mathematical 
"field", the basic properties of vectors spaces and linear algebra apply to binary vectors and matrices. Thus, 
the stabilizer is minimal in the sense introduced above if the corresponding binary vectors are independent 
over Z2. Given two binary (column) vectors x and y of length two associated with Pauli products, the 
property of anticommuting is equivalent to By = 1, where B is the block diagonal 2n x 2n matrix 

with 2x2 blocks given by ^ ^ q ^ . This means that S*"*" can be identified with the set of vectors x 

such that x^ By = for all binary vectors y associated with the members of S. It turns out that the 
inner product {x, y) = x^By arises in the study of classical codes over the four-element mathematical 
field GF(4:), which can be represented by the vectors 00, 01, 10 and 11 with addition modulo 2 and a new 
multiplication operation. This relationship leads to the construction of many good stabilizer codes [6]. 

6.2 Conserved quantities, symmetries and noiseless subsystems. 

Even though a physical system may be exposed to error, some of its properties are often not affected by 
the errors. If these "conserved quantities" can be identified with the defining quantities of qubits or other 
information units, error-free storage of information can be ensured without active intervention. This is the 
idea behind noiseless subsystems. 

When do noiseless subsystems exist and how can they be constructed? The examples discussed in 
the previous sections show that a noiseless subsystem may be a subset of physical qubits, as in the trivial 
two-qubit example, or it may require a more abstract subsystem identification, as in the example of the 
three spin-| particles. As will be explained, in both cases, there are quantities conserved by the errors that 
can be used to identify the noiseless subsystem. 
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A simple classical example for the use of conserved quantities consists of two physical bits subject 
to errors that either flip both bits or leave them alone. A quantity invariant under this noise model is the 
parity P{s) of a state s of the two bits. The parity P(s) is defined as the number of I's in the bitstring s 
reduced modulo 2: P(oo) = -P(ii) = and -P(oi) = -P(io) = 1. Flipping both bits does not change 
the value of P. Consequently, the two values of P can be used to identify the two states of a noiseless bit. 
The syndrome subsystem can be associated with the value (nonconserved) of the first physical bit using 
the function defined by F{ob) = 0, F{ib) = 1. The corresponding subsystem identification is obtained 
by using the values of P and F as the states of the syndrome (left) and the noiseless, information-carrying 
subsystem (right) according to ab ^ F{ab) ■ P{ab). 

In quantum systems, conserved quantities are associated with the presence of symmetries, that is, with 
operators that commute with all possible errors. In the trivial two-qubit example, operators acting only on 
qubit 2 commute with the error operators. In particular, if E is any one of the errors, EaJ^'^ = au^^^E, for 
u = x,y, z. It follows that the expectations of cr^'^^^ are conserved. That is, if p is the initial state (density 
matrix) of the two physical qubits and p' is the state after the errors acted, then tr a.J~'^'> p' = tr aj^^p- 
Because the state of qubit 2 is completely characterized by these expectations, it follows immediately that 
it is unaffected by the noise. 

The trivial two-qubit example suggests a general strategy for finding a noiseless qubit: First determine 
the commutant of the errors, which is the set of operators that commute with all errors. Then find a subset 
of the commutant that is algebraically equivalent to the operators characterizing a qubit. The equivalence 
can be formulated as a one-to-one map / from qubit operators to operators in the commutant. For the 
range of / to be algebraically equivalent, / must be linear and satisfy f{A^) = f{Ay and f{AB) = 
f{A)f(B). Once such an equivalence is found, a fundamental theorem from the representation theory of 
finite dimensional operator algebras implies that a subsystem identification for a noiseless qubit exists [22, 
25]. 

The strategy can be applied to the example of three spin-| particles subject to collective errors. One 
can determine the commutant by using the physical properties of spin to find the conserved quantities as- 
sociated with operators in the commutant, as suggested in Fig. 8. Alternatively, observe that by definition, 
this error model is symmetric under permutations of the particles. Therefore, the actions of these permu- 
tations on the state space form a group 11 of unitary operators commuting with the errors. It is a fact that 
the commutant of the set of collective errors consists of the linear combinations of operators in 11. With 
respect to the group 11, one can immediately determine the space V3/2 of symmetric states, that is, those 
that are invariant under the permutations. It is spanned by 

ITTT), ^(lTTi> + ITiT> + liTT>), ^(iTii) + liti) + liiT)), liii>- (47) 

A basic result from the representation theory of groups implies that the projection onto V3/2 is given by 
P3/2 = I J2gen9- "^^^ orthogonal complement V1/2 of V3/2 is invariant under 11 and can be analyzed 
separately. With the subsystem identification of Eq. 20 already in hand, one can see that the permutation 
TTi which permutes the spins according tol^2^3^1 acts on the noiseless qubit by applying 
^240° = e"*'^^^'^/^, a 240° rotation around the 2;-axis. Similarly, the permutation 112 which exchanges the 
last two spins acts as a,j. on the qubit. To make them algebraically equivalent to the corresponding qubit 
operators, it is necessary to eliminate their action on V3/2 by projecting onto Vi/2'- vr^ = (1 — P3/2)7ri and 
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= (1 — P3/2)7r2. Sums of products of tx[ and vTg are equivalent to the corresponding sums of products 
of 2^240° and ax, which generate all qubit operators. To get the subsystem identification of Eq. 20, one 
can start with a common eigenstate |^) of tx[ (a 2; -rotation on the noiseless qubit) and 2 (the syndrome 
subsystem's a^) with eigenvalues e^*^'^/'^ and 1, respectively. The choice of eigenvalues implies that 
1^) ^ IT) ■ |o) in the desired identification. The other logical states of the syndrome spin-| and the 
noiseless qubit can be obtained by applying 1X2, 2Jx and tx'^J^ to which act by flipping the states 
of the qubit or the syndrome spin. This method for obtaining the subsystem identification generalizes to 
other operator equivalences and error operators. 

7 Fault Tolerant Quantum Communication and Computation 

The utility of information and information processing depends on the the ability to implement large num- 
bers of information units and information processing operations. We say that an implementation of in- 
formation processing is scalable if the implementation can realize arbitrarily many information units and 
operations without loss of accuracy and with physical resource overheads that are polynomial (or "effi- 
cient") in the number of information units and operations. Scalable information processing is achieved by 
implementing information fault tolerantly. 

One of the most important results of the work in quantum error-correction and fault-tolerant computa- 
tion is the accuracy threshold theorem, according to which scalability is possible, in principle, for quantum 
information. 

Theorem. Assume the requirements for scalable QIP (see below). If the error per gate 
is less than a threshold, then it is possible to efficiently quantum compute arbitrarily accu- 
rately. (48) 

7.1 Requirements for Scalable QIP 

The value of the threshold accuracy (or error) depends strongly on which set of requirements is used, in 
particular, the error model that is assumed. The requirements are closely related to the basic requirements 
for constructing a quantum information processor [26] but have to include explicit assumptions on the 
error model and on the temporal and spatial aspects of the available quantum control. 
Scalable physical systems: It is necessary to have access to physical systems that are able to support 
qubits or other basic units of quantum information. The systems must be scalable, that is, they must be 
able to support any number of independent qubits. 

State preparation: One must be able to prepare any qubit in the standard initial state |o) . Any preexisting 
content is assumed to be lost, as would happen if, for example, the qubit is first discarded and then replaced 
by a prepared one. The condition can be weakened; That is, it is sufficient that a large fraction of the qubits 
can be prepared in this way. 

Measurement: A requirement is the ability to measure any qubit in the logical basis. Again, it is sufficient 
that a sufficiently large fraction of the qubits are measurable. For solving computational problems with 
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deterministic answers, the standard projective measurement can be replaced by weak measurements that 
return a noisy number whose expectation is the probability that a qubit is in the state 1 1) [17]. 
Quantum control: One must have the ability to implement a universal set of unitary quantum gates acting 
on a small number (usually at most two at a time) of qubits. For most accuracy thresholds, it is necessary to 
be able to apply the quantum control in parallel to any number of disjoint pairs of qubits. This parallelism 
requirement can be weakened if a nearly noiseless quantum memory is available. The requirement that 
it be possible to apply two-qubit gates to any pair of qubits is unrealistic given the constraints of three- 
dimensional space. Work on how to deal with this problem is ongoing [11]. The universality assumption 
can be substantially weakened by replacing some or all unitary quantum gates with operations to prepare 
special states or by having additional measurement capabilities. See, for example [27] and the references 
therein. 

Errors: The error probability per gate must be below a threshold and satisfy independence and locality 
properties (see Sec. 4). The definition of "gate" includes the 'no-op", which is the identity operation 
implemented over the time required for a computational step. For the most pessimistic independent, 
local error models, the error threshold is above ~ 10^^. For the independent depolarizing error model, 
it is believed to be better than 10"'' [28]. For some special error models, the threshold is substantially 
higher. For example, for the independent "erasure" error model, where error events are always detected, 
the threshold is above .01, and for an error model whose errors are specific, unintentional measurements 
in the standard basis of a qubit, the threshold is 1 [29, 30]. The threshold is also well above .01 when the 
goal is only to transmit quantum information through noisy quantum channels [31]. 

7.2 Realizing Fault-Tolerance 

The existing proofs of the accuracy threshold theorems consist of explicit instructions for building a scal- 
able quantum information processor and analyses of its robustness against the assumed error model. The 
instructions for realizing scalable computation are based on the following simple idea. Suppose that the 
error rate per operation for some way of realizing qubits is p. We can use these qubits and a quantum 
error-correcting code to encode logical qubits for which the storage error rate is reduced. For example, if 
a one-error correcting code is used, the error rate per storage interval for the logical qubits is expected to 
be < cp^ for some constant c. Suppose that we can show how to implement encoded operations, prepara- 
tions, measurement and the subroutines required for error-correction such that this inequality is now valid 
for each basic encoded step, perhaps for a larger constant C. Suppose furthermore that the errors for the 
encoded information still satisfy the assumed error model. The newly defined logical qubits then have an 
error rate of < Cp^, which is less than p for p < 1/C. We can use the newly realized qubits as a foundation 
for making higher level logical qubits. This results in multiple levels of encodings. In the next level (level 
2), the error rate is < C^p'^, and after k iterations it is < , a doubly-exponentially decreasing 

function of k. This procedure is called "concatenation" (Fig. 9). Because the complexity, particularly 
the number of physical qubits needed for each final logical qubit, grows only singly-exponentially in k, 
the procedure is efficient. Specifically, to achieve a logical error of e per operation requires of the order 
of I log(e)|'' resources per logical qubit for some finite r. In practice, this simple idea is still dauntingly 
complex, but there is hope that for realistic errors in physical systems and by cleverly trading off different 
variations of these techniques, much of the theoretical complexity can be avoided [32]. 
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FIG. 9: Schematic representation of concatenation. The bottom level represents qubits realized more-or- 
less directly in a physical system. Each next level represents logical qubits defined by means of subsystems 
in terms of the previous level's qubits. More efficient subsystems might represent multiple qubits in one 
code block rather than the one qubit per code block shown here. 



Many important developments and ideas of quantum information were ultimately needed to realize 
encoded operations, preparations, measurements and error-correction subroutines that behave well with 
respect to concatenation. Stabilizer codes provide a particularly nice setting for implementing many of 
these techniques. One reason is that good stabilizer codes are readily constructed. Another is that they 
enable encoding operations in a way that avoids spreading errors between the qubits of a single code 
word [14]. In addition, there are many tricks based on teleportation that can be used to maintain the syn- 
drome subsystems in acceptably low-error states and to implement general operations systematically [33]. 
To learn more about all of these techniques, see the textbook by Nielsen and Chuang [34] and the works 
of Gottesman [14] and Preskill [15]. 
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8 Concluding Remarks 

The advancements in quantum error-correction and fault-tolerant QIP have shown that in principle scal- 
able quantum computation is achievable. This is a crucial result because it suggests that experimental 
efforts in QIP will eventually lead to more than a few small scale applications of quantum information to 
communication and problems with few qubits. However, the general techniques for achieving scalability 
that are known are difficult to realize. Existing technologies are far from achieving sufficient accuracy 
even for just two qubits — at least in terms of the demands of the usual accuracy threshold theorems. There 
is hope that more optimistic thresholds can be shown to apply if one considers the specific constraints 
of a physical device, better understands the dominant sources of errors, and exploits tailor-made ways of 
embedding quantum information into subsystems. Current work in this area is focused on finding such 
methods of quantum error control. These methods include approaches to error control not covered in this 
introduction — for example, techniques for actively turning off the error-inducing environmental interac- 
tions [35, 36] and modifications to controlling quantum systems that eliminate systematic and calibration 
errors [37, 38]. Further work is also needed to improve the thresholds for the more pessimistic error 
models and for developing more-efficient scalability schemes. 
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9 Glossary 

Bit. The basic unit of deterministic information. It is a system that can be in one of two possible states, o 
and 1. 

Bit string. A sequence of o's and I's that represents a state of a sequence of bits. Bit strings are words in 
the binary alphabet. 

Classical information. The type of information based on bits and bit strings and more generally on words 
formed from finite alphabets. This is the information used for communication between people. Clas- 
sical information can refer to deterministic or probabilistic information, depending on the context. 

Code. A set of states that can be used to represent information. The set of states needs to have the 
properties of the type of information to be represented. The code is usually a subset of the states of 
a given system Q. It is then a Q-code or a code on Q. If information is represented by a state in the 
code, Q is said to carry the information. 

Code word. A state in a code. The term is primarily used for classical codes defined on bits or systems 
with non-binary alphabets. 

Concatenation. An iterative procedure in which higher-level logical information units are implemented 
in terms of lower-level units. 

Control error. An error due to non-ideal control in applying operations or gates. 

Communication channel. A means for transmitting information from one place to another. It can be 
associated with a physical system in which the information to be transmitted is stored by the sender. 
The system is subsequently conveyed to the receiver, who can then make use of the information. 

Correctable error set. For a given code, a set of errors such that there exists an implementable procedure 
R that, after any one of these errors E acts on a state x in the code, returns the system to the state: 
X = REx. What procedures are implementable depends on the type of information represented by 
the system and, if it is a physical system, its physics. 

Decoding. The process of transferring information from an encoded form to its "natural" form. In the 
context of error correction, decoding is often thought of as consisting of two steps, one which 
removes the errors' effects (sometimes called the recovery procedure) and one that extracts the 
information (often also called decoding, in a narrower sense). 

Depolarizing errors. An error model for qubits in which random Pauli operators are applied indepen- 
dently to each qubit. 

Detectable error. For a given code, an error that has no effect on an initial state in the code if an obser- 
vation determines that the state is still in the code. If the state is no longer in the code, the error is 
said to have been detected and the state no longer represents valid information. 

Deterministic information. The type of information based on bits and bit strings. This is the same as 
classical information but explicitly excludes probabilistic information. 
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Encoding. The process of transferring information from its "natural" form to an encoded form. It requires 
an identification of the valid states associated with the information and the states of a code. The 
process acts on an information unit and replaces it with the system whose state space contains the 
code. 

Environment. In the context of information encoded in a physical system, it refers to other physical 

systems that may interact with the information-carrying system. 
Environmental noise. Noise due to unwanted interactions with the environment. 

Error. Any unintended effect on the state of a system, particularly in storing or otherwise processing 
information. 

Error basis. A set of state transformations that can be used to represent any error. For quantum systems, 
errors can be represented as operators acting on the system's state space, and an error basis is a 
maximal, linearly independent set of such operators. 

Error control. The term for general procedures that limit the effects of errors on information represented 
in noisy, physical systems. 

Error correction. The process of removing the effects of errors on encoded information. 

Error-correcting code. A code with additional properties that enable a decoding procedure to remove 
the effects of the dominant sources of errors on encoded information. Any code is error-correcting 
for some error-model in this sense. To call a code "error-correcting" emphasizes the fact that it was 
designed for this purpose. 

Error model. An explicit description of how and when errors happen in a given system. Typically, a 
model is specified as a probability distribution over error operators. More general models may need 
to be considered, particularly in the context of fault tolerant computation, for which correlations in 
time are important. 

Fault tolerance. A property of encoded information that is being processed with gates. It means that er- 
rors occurring during processing, including control errors and environmental noise, do not seriously 
affect the information of interest. 

Gate. An operation applied to information for the purpose of information processing. 

Hamming distance. The Hamming distance between two binary words (sequences of o and i) is the 
number of positions in which the two words disagree. 

Hilbert space. A ra-dimensional Hilbert space consists of all complex ri-dimensional vectors. A defining 
operation in a Hilbert space is the inner product. If the vectors are thought of as column vectors, 
then the inner product (x, y) of x and y is obtained by forming the conjugate transpose of x and 
calculating (x, y) = x'^y. The inner product induces the usual norm |xp = (x, x). 

Information. Something that can be recorded, communicated and computed with. Information is fun- 
gible, which implies that its meaning can be identified regardless of the particulars of the physical 
realization. Thus, information in one realization (such as ink on a sheet of paper) can be easily 
transferred to another (for example, spoken words). Types of information include deterministic, 
probabilistic and quantum information. Each type is characterized by information units, which are 
abstract systems whose states represent the simplest information of this type. These define the "nat- 
ural" representation of the information. For deterministic information the unit is the bit, whose states 
are symbolized by o and i. Information units can be put together to form larger systems and can be 
processed with basic operations acting on a small number of units at a time. 
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Length. For codes on n basic information units, the length of the code is n. 

Minimum distance. The smallest number of errors that is not detectable by a code. In this context, the 
error model consists of a set of error operators without specified probabilities. Typically the concept 
is used for codes on n information units and the error model consists of operators acting on any one 
of the units. For a classical binary code, the minimum distance is the smallest Hamming distance 
between two code words. 

Noise. Any unintended effect on the state of a system, particularly an effect with a stochastic component 

due to incomplete isolation of the system from its environment. 
Operator. A function transforming the states of a system. Operators may be restricted depending on the 

system's properties. For example, operators acting on quantum systems are always assumed to be 

linear. 

Pauli operators. The Hermitian matrices a^, cry and (Eq. 9) acting on qubits. It is often convenient to 
consider the identity operator to be included in the set of Pauli operators. 

Physical system. A system explicitly associated with a physical device or particle. The term is used to 
distinguish between abstract systems used to define a type of information and specific realizations, 
which are subject to environmental noise and errors due to other imperfections. 

Probabilistic bit. The basic unit of probabilistic information. It is a system whose state space consists of 
all probability distributions over the two states of a bit. The states can be thought of as describing 
the outcome of a biased coin flip before the coin is flipped. 

Probabilistic information. The type of information obtained when the state spaces of deterministic in- 
formation are extended with arbitrary probability distributions over the deterministic states. This is 
the main type of classical information to which quantum information is compared. 

Quantum information. The type of information obtained when the state space of deterministic informa- 
tion is extended with arbitrary superpositions of deterministic states. Formally, each deterministic 
state is identified with one of an orthonormal basis vector in a Hilbert space and superpositions are 
unit-length vectors that are expressible as complex linear sums of the chosen basis vectors. Ulti- 
mately it is convenient to extend this state space again by permitting probability distributions over 
the quantum states. This is still called quantum information. 

Qubit. The basic unit of quantum information. It is the quantum extension of the deterministic bit; that 
is, its state space consists of the unit-length vectors in a two dimensional Hilbert space. 

Repetition code. The classical, binary repetition code of length n consists of the two words oo . . . o and 
11 ... 1. For quantum variants of this code one applies the superposition principle to obtain the 
states consisting of all unit-length complex linear combinations of the two classical code words. 

Scalability. A property of physical implementations of information processing that implies that there are 
no bounds on accurate information processing. That is, arbitrarily many information units can be 
realized and they can be manipulated for an arbitrarily long amount of time without loss of accuracy. 
Furthermore, the realization is polynomially efficient in terms of the number of information units 
and gates used. 

States. The set of states for a system characterizes the system's behavior and possible configurations. 

Subspace. For a Hilbert space, a subspace is a linearly closed subset of the vector space. The term can be 
used more generally for a system Q of any information type: A subspace of Q or, more specifically, 
of the state space of Q is a subset of the state space that preserves the properties of the information 
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type represented by Q. 

Subsystem. A typical example of a subsystem is the first (qu)bit in a system consisting of two (qu)bits. In 
general, to obtain a subsystem of system Q, one first selects a subset C of Q's state space and then 
identifies C as the state space of a pair of systems. Each member of the pair is then a subsystem of 
Q. Restrictions apply depending on the types of information carried by the system and subsystems. 
For example, if Q is quantum and so are the subsystems, then C has to be a linear subspace and the 
identification of the subsystems' state space with C has to be unitary. 

Subsystem identification. The mapping or transformation that identifies the state space of two systems 
with a subset C of states of a system Q. In saying that L is a subsystem of Q, we also introduce a 
second subsystem S and identify the state space of the combined system LS with C . 

Syndrome. One of the states of a syndrome subsystem. It is often used more narrowly for one of a 
distinguished set of basis states of a syndrome subsystem. 

Syndrome subsystem. In identifying an information-carrying subsystem in the context of error-correction, 
the other member of the pair of subsystems required for the subsystem identification is called the 
syndrome subsystem. The terminology comes from classical error-correction, in which the syn- 
drome is used to determine the most likely error that has happened. 

System. An entity that can be in any of a specified number of states. An example is a desktop computer 
whose states are determined by the contents of its various memories and disks. Another example 
is a qubit, which can be thought of as a particle whose state space is identified with complex, two- 
dimensional, length-one vectors. Here, a system is always associated with a type of information, 
which in turn determines the properties of the state space. For example, for quantum information 
the state space is a Hilbert space. For deterministic information, it is a finite set called an alphabet. 

Twirling. A randomization method for ensuring that errors act like a depolarizing error model. For one 
qubit, it involves applying a random Pauli operator before the errors occur and then undoing the 
operator by applying its inverse. 

Unitary operator. A linear operator f/ on a Hilbert space that preserves the inner product. That is, for 
all X and y, {Ux, Uy) = {x, y). If f/ is given in matrix form, then this condition is equivalent to 
f/tf/ = 1. 

Weight. For a binary word, the weight is the number of I's in the word. For an error operator acting on 
n systems by applying an operator to each one of them, the weight is the number of non-identity 
operators applied. 
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